Earth Mover's Distance Yields Positive Definite Kernels For Certain Ground Distances
نویسندگان
چکیده
Positive definite kernels are an important tool in machine learning that enable efficient solutions to otherwise difficult or intractable problems by implicitly linearizing the problem geometry. In this paper we develop a set-theoretic interpretation of the Earth Mover’s Distance (EMD) that naturally yields metric and kernel forms of EMD as generalizations of elementary set operations. In particular, EMD is generalized to sets of unequal size. We also offer the first proof of positive definite kernels based directly on EMD, and provide propositions and conjectures concerning what properties are necessary and sufficient for EMD to be conditionally negative definite. In particular, we show that three distinct positive definite kernels – intersection, minimum, and Jaccard index – can be derived from EMD with various ground distances. In the process we show that the Jaccard index is simply the result of a positive definite preserving transformation that can be applied to any kernel. Finally, we evaluate the proposed kernels in various computer vision tasks.
منابع مشابه
Supervised Earth Mover's Distance Learning and Its Computer Vision Applications
Earth Mover’s Distance (EMD) is an intuitive and natural distance metric for comparing two histograms or probability distributions. We propose to jointly optimize the ground distance matrix and the EMD flow-network based on partial ordering of histogram distances in an optimization framework. Two applications in computer vision are used to demonstrate the effectiveness of the algorithm: firstly...
متن کاملFeature Space Interpretation of SVMs with non Positive Definite Kernels
The widespread habit of “plugging” arbitrary symmetric functions as kernels in support vector machines (SVMs) often yields good empirical classification results. However, in case of non conditionally positive definite (non-cpd) functions they are hard to interpret due to missing geometrical and theoretical understanding. In this paper we provide a step towards comprehension of SVM classifiers i...
متن کاملMetric-Preserving Reduction of Earth Mover's Distance
We prove that the earth mover’s distance problem reduces to a problem with half the number of constraints regardless of the ground distance, and propose a further reduced formulation when the ground distance comes from a graph with a homogeneous neighborhood structure. We also propose to apply our formulation to the non-negative matrix factorization.
متن کاملEquivalence of Distance-based and Rkhs-based Statistics in Hypothesis Testing by Dino Sejdinovic,
We provide a unifying framework linking two classes of statistics used in two-sample and independence testing: on the one hand, the energy distances and distance covariances from the statistics literature; on the other, maximum mean discrepancies (MMD), that is, distances between embeddings of distributions to reproducing kernel Hilbert spaces (RKHS), as established in machine learning. In the ...
متن کاملEquivalence of Distance-based and Rkhs-based Statistics in Hypothesis Testing by Dino Sejdinovic, Bharath Sriperumbudur,
We provide a unifying framework linking two classes of statistics used in two-sample and independence testing: on the one hand, the energy distances and distance covariances from the statistics literature; on the other, maximum mean discrepancies (MMD), that is, distances between embeddings of distributions to reproducing kernel Hilbert spaces (RKHS), as established in machine learning. In the ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- CoRR
دوره abs/1510.02833 شماره
صفحات -
تاریخ انتشار 2015